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Subroutine  for  the 

Calculation  of  CODEN  Check  Characters 
David  Garvin 


A  FORTRAN  subroutine  is  described  that  computes  the  check  character 
for  an  ASTM  CODEN  for  Journal  Titles.   This  routine,  written  for  input 
in  Hollerith  characters,  is  adaptable  to  other  coding  schemes.   A  listing 
of  the  routine  is  provided . 

Keywords:  Check  character;  CODEN;  computer  program;  journal 
abbreviations . 

1.     Introduction 

CODEN  for  journal  titles  are  abbreviations,  such  as  JCPSA,  that 
provide  a  completely  defined,  short,  frequently  mnemonic  identifier 
for  a  serial  publication.   This  system  is  maintained  by  the  American 
Society  for  Testing  and  Materials  and  is  used  both  by  information  systems 
and  some  scientific  journals. 

When  other  identifiers  are  added  to  the  CODEN  a  compact  specific 
reference  to  an  article  can  be  obtained.   E.g. 

Schofield  IJCKBO-1972-4-255 

identifies  an  article  by  K.  Schofield,  Int.  J.  Chem.  Kinetics,  (1972) 
vol.  4  pg.  255.   In  a  similar  manner  the  designation 

IJCKBO  4(3)  255-336  (1972) 

is  used  as  a  "bibliographic  strip"  to  identify  the  page  contents  of 
issue  no.  3  of  volume  4.  This  is  useful  in  accession  procedures  in 
libraries . 

The  sixth  character,  "0"  above,  is  a  "check  character"  formed 
from  the  first  five  by  a  unique  algorithm.   It  is  used  to  spot 
transcription  errors  when  the  CODEN  is  keyboarded. 


In  this  note  a  FORTRAN  subroutine  is  provided  that  calculates 
and  verifies  check  characters  for  CODEN.   The  technical  description 
starts  in  section  2  and  the  program  listing  is  in  section  4.   The 
application  of  the  routine  is  discussed  below.   The  algorithm  given 
in  the  ASTM  CODEN  publication  and  used  here  is  stated  at  the  end  of 
this  section. 

Application   There  are  three  types  of  potential  uses  for  this  sub- 
routine . 

(1)  It  may  be  used  to  generate  check  characters  for  CODEN  for 
use  in  an  information  system.  At  present  these  characters  are  not  given 
in  the  ASTM  publication. 

(2)  It  may  be  used  to  check  the  validity  of  CODEN  keyboarded 
in  constructing  an  information  system.   This  is  the  most  likely  use  in 
a  small  system. 

(3)  It  may  serve  as  a  benchmark  routine  against  which  the  per- 
formance of  more  efficient  routines  can  be  tested.   This  is  the  most 
likely  use  in  a  large  system. 

The  ASTM  algorithm  for  check  characters  [1]  is  quoted  below: 

"CHECK  CHARACTERS 

"The  following  information  is  included  about  check  characters 
for  those  users  of  the  CODEN  system  who  may  wish  to  use  the  character. 
Its  purpose  is  to  provide  a  letter  or  digit  which,  when  properly  cal- 
culated, will  eliminate  errors  in  CODEN  notation  in  the  keyboarding 
stage. 

"The  initial  suggestion  for  the  use  of  a  machine -generated  check- 
letter  for  controlling  errors  in  CODEN  came  from  A.  D.  Pratt  at  the 
School  of  Library  Science,  University  of  Indiana,  Bloomington.   F.  E. 
Hajjar  is  responsible  for  reducing  the  idea  to  practice  for  Chemical 
Abstracts  Service.   The  check-character  becomes  a  sixth  letter  or 
digit  whose  value  depends  upon  value  and  order  of  the  five  elements 
of  the  CODEN.   Any  error  in  a  single  letter  or  an  inversion  of  letters, 
plus  most  other  types  of  errors,  will  produce  a  check  character  that 
is  inconsistent  with  the  correct  one  and  expose  the  error.   Following 
is  a  brief  description  of  the  system  as  used  at  Chemical  Abstracts 
Service. 


"1.    CODEN  may  be  entered  as  a  5-  or  6-character  field. 

(a)  If  the  CODEN  is  entered  as  a  5-character  field,  then  a 
check-character  is  generated  and  added  as  a  sixth  character. 

(b)  If  the  CODEN  is  entered  as  a  6-character  field,  then  a 
check-character  is  generated  from  the  first  five  and  matched  with  the 
sixth.   If  the  check-characters  match,  the  CODEN  is  verified  as  being 
valid.   If  the  check-characters  do  not  match,  then  the  generated 
character  replaces  the  original  check-character  and  a  switch  is  turned 
on  to  be  interrogated  by  the  operating  program. 

"2.    The  check-character  is  generated  as  follows: 

(a)  Each  alpha-numeric  character  of  the  CODEN  is  replaced  with 
an  equivalent  value.   The  equivalents  are: 

CODEN:     A,  B, Y,  Z,  1,  2, 9,0 

Equivalent:     1,  2, 25,  26,  27,  28, 35,  36 

(b)  The  equation  used  to  generate  the  check-character  is : 

(llxNp  +  (7xN2)  +  (5xN3)  +  (3xN.)  +  (lxN  )  =  X  +  remainder 

34 

34 

Where  N, ,  N~,  etc.  are  the  equivalents  of  the  CODEN  characters  in  order 
of  their  appearance  in  the  CODEN  and  X  is  a  whole  number  that  is  dis- 
carded . 

(c)  The  remainder  is  converted  to  a  check-character  by  the 
following  set  of  equivalents: 

remainder:   1,  2, 25,  26,  27,  28,  29, 33,  34 

[34  =  0] 

Check-character:   A,  B,  .  . .  .  Y,  Z,  2,  3,  4,....  8,  9 
[or  zero,  [sic]) 

The  numeric  check-characters  one  (1)  and  zero  (0)  have  been  eliminated 
to  avoid  confusion  with  the  alphabetic  characters  I  and  0. 


(d)   Sample  calculation  of  a  check  character  for  the  CODEN 
JACS-A: 


N1 

= 

J 

= 

10 

N2 

= 

A 

= 

1 

N, 

= 

C 

= 

3 

N4 

= 

s 

= 

19 

N5 

= 

A 

= 

1 

Substituting  these  equivalents  for  the  characters  in  the  CODEN  into 
the  equation  yields  an  "X"  of  5  and  a  remainder  of  20.   The  check- 
character  equivalent  of  20  is  T.   Thus  the  complete  CODEN  with  check- 
character  is  JACS-AT." 


The  rule  as  applied  in  this  program  calculates  X  then  uses  the  formula 

(X-l)  Mod  34  +  1  =  C 

where  C  is  the  check  character.   This  rule  is  implied  in  paragraph 
2  c  in  the  quotation,  where  a  remainder  of  zero  must  be  assigned  to 
the  check  character  nine  (9) . 

2.     GENERAL  DESCRIPTION  OF  PROGRAM 

Name:    Subroutine  CODCHK  (LINE,  MINE,  IS,  IL,  IERR) 
Language:   Fortran  IV  with  "implied  do  loop"  DATA  statements 
Purpose:    Calculation  of  (or  verification  of)  the  check 
character  of  a  CODEN  for  Periodical  Titles.   Uses  the  algorithm  stated 
in  ASTM  DS  23B,  vol.  1  pg .  ix. .  [1]. 

Formal  Arguments:   LINE :   Integer  array 

MINE:   Integer  array,  6  words  minimum 
IS,  IL,  IERR:   Integers 

Input  Required:   The  CODEN  is  to  be  processed  is  suppled  to  the 
routine  in  the  array  LINE,  one  character  per  word.   It  is  located  in 
the  zone  LINE  (IS)  to  LINE  (IL)  where  IS  and  IL  are  the  left  and  right 
limits,  IS<IL.   It  is  the  only  item  in  this  zone.   In  this  version  the 
characters  are  assumed  to  be  Hollerith,  in  Al  (left  adjusted)  format. 

The  first  six  non-blank  characters  are  processed.   Embedded 
blanks  and  minus  signs  are  ignored.   The  sixth  character  may  be  missing. 
Examples  of  acceptable  CODEN  are 

JACSA,      JACS-A,    JACS   A,    JACSAT, 
J  A  C  S   A  T 


Signals  returned:    Input  errors  are  signalled  by  a  non-zero  value 
of  IERR  returned  by  the  routine 

IERR  =  0  Successful  calculation 

=  1  Check  character  provided  at  input  did  not  match 

the  calculated  one 

=  2  There  were  more  than  six  characters  in  the  input 

(first  six  used) 

=  4  There  were  fewer  than  five  legal  characters  in  the 

input  (no  calculation  made) 

=  8  Illegal  character  in  the  input  (ignored) 

These  signals  are  additive. 

Text  returned:    The  routine  returns  a  six  character  CODEN  in  MINE, 
one  character  per  word,  Al  format.   The  sixth  character  is  the  check 
character  (except  when  IERR=4,  in  which  case  MINE  (6)  is  left  blank) , 

3 .     REMARKS : 


Subroutine  operation:    The  input  characters  are  converted  to 
integers  using  a  table  search  of  a  dictionary,  INDICT.   The  word 
INDICT  (K)  contains  the  character  that  corresponds  to  the  integer  K 
in  the  algorithm.   A  similar  dictionary,  J0DICT,  is  used  to  convert 
the  integer  value  of  the  check  character  to  the  character  itself. 
Both  dictionaries  in  this  version  contain  left  adjusted  Hollerith 
characters.   Other  character  codes  may  be  substituted.   The  two  constants 
used  in  the  program,  blank  and  minus,  are  defined  in  data. 

This  table  search  procedure  is  much  slower  than  a  table  look-up 
based  on  right  adjusted  integers.   Any  production  program  should  use 
the  latter  technique.   The  search  procedure  is  used  here  because  ANSI 
FORTRAN  does  not  provide  for  shifting  Hollerith  characters,  this  being 
the  language  feature  needed  if  the  procedure  is  to  be  changed. 

Relation  to  other  routines:    This  subroutine  must  be  called  into  action 
by  another  program.   The  essential  function  of  the  calling  program  is  to 
isolate  the  CODEN,  that  is,  to  set  pointers  (IS  and  IL)  showing  where  in 
the  array  LINE  the  CODEN  (and  only  it)  is  located.   The  calling  program 
must  also  dispose  of  the  result  returned:   print  error  messages,  replace 
the  old  with  the  new,  etc.   The  nature  of  this  calling  routine  cannot 
be  specified  here  since  it  must  be  constructed  to  match  the  file  of 
material  that  is  to  be  examined.   (However,  a  sample  is  included  in 
the  program  listing) . 


4.     PROGRAM  LISTING: 


Listings  of  two  routines  are  attached.   These  are  Subroutine 
CODCHK,  which  does  the  work,  and  Main  program  CODEN,  which  is  an  example 
showing  how  CODCHK  can  be  used  with  simple  card  input.   An  example  of 
the  output  follows  the  listings. 


[1]   Blumenthal,  J.  G.,  Karaman,  M.  and  Peters,  A.  "CODEN  for  Periodical 
Titles"  ASTM  Data  Series  DS  23B  (American  Society  for  Testing  and 
Materials,  Philadelphia,  1970),  Volume  1  pgs  ix-x. 


SUBROUTINE  CODCHK ( L I NE , MI NE, IS, IL 
DIMENSION  LINE( 1)  ,MINE(  1  ) 
SIZES  OF  LINE  AND  MINE  ARE  SET  IN  CAL 
MINIMUM  IS  6  WORDS  EACH 

DIMENSION  INDICT(36)  ,JODICT(34)  » 
ROUTINE  COMPUTES  A  CHECK  CHARACTER  FO 
THESE  FORMS 

JACSA     JACS-A     JACS  A    JACSA 

(EMBEDDED  BLANKS  AND  DASH  IGNOR 

ROUTINE   RECEIVES  SIX  CHAR  (Al  FORMAT 

SIX  CHAR  (Al  FORMAT)  IN  MINE.   THE  S 

CHECK  CHARACTER  CALC  PER   PG  IX-X 
THE  ARRAY  LINE  IS  NOT  CHANGED. 
IF  THE  INCOMING  CODEN  HAD  A  CHECK  CH 

THE  CALC  MADE  HERE. 
THE   FORMAL  ARGUMENT   -IERR-  IS  RETU 
AND  MEANINGS 


,  IERR) 

LING  PROGRAM 

LIN2(6) 

R  A  CODEN   WRITTEN  IN  ONE  OF 

T 

ED) 

)  IN  -LINE-    AND   RETURNS 

IXTH  CHAR  IN  MINE  IS  THE 

OF  ASTM  DATA  SER.  DS  23  B 

AR..THAT  IS  COMPARED  WITH 

RNED  WITH  THESE   VALUES 


OK 

CHECK  CHAR  DID  NOT  MATCH 

CODEN  HAD  MORE  THAN  6  CH 

CODEN  HAD  LESS  THAN  4 
ILLEGAL  CHARACTER  IN  COD 


IERR=0 
IERR=1 
IERR  =  2 
IERR  =  4 
IERR  =  8 
CONSTANTS 
DATA  IBLNK.IMIN/  1H  ,1H-/ 
DICTIONARIES 
INPUT       ABCDEFGHI JKLMNOPQRSTUVWXYZ1 
OUTPUT     ABCDEFGHIJKLMNOPQRSTUVWXYZ2 
INPUT  CHAR  DICTIONARY 

DATA  < INDICT! I ), 1=1,36)/ 

1  1HA»1HB,1HC,1HD,1HE»1HF,1HG,1HH,1 

2  1HQ.1HR,1HS»1HT,1HU,1HV,1HW,1HX,1 

3  1H1,1H2,1H3,1H4,1H5,1H6,1H7,1H8,1 
OUTPUT  DICTIONARY  FOR  CHECK  CHARACTER 

DATA( JODICT( I ) , 1=1 ,34)  / 

1  1HA,1HB,1HC,1HD,1HE,1HF,1HG,1HH,1 

2  1HQ,1HR»1HS,1HT,1HU.1HV,1HW,1HX,1 

3  1H2,1H3,1H4,1H5,1H6,1H7,1H8,1H9  / 
DO  10  1=1,6 
MINE(  I  )  =  1H 
LIN2( I )=0 
IERR=0 
MODE=0 
K  =  0 
IF  (IL-IS-4)  140,20,20 

SEARCH  INDICT  FOR  THE  CHARACTERS 

DO  80  I=IS,I L 

IF  f LINE( I J-IBLNK)  30,80,30 

IF  ( LINE( I )-IMIN)  40,80,40 

DO  50  J=l,36 

IF  ( LINE( I )-INDICT( J) )  50,60,50 

CONTINUE 
ILLEGAL  CHARACTER 

IF  (IERR.LT.8)  IERR=IERR  +8 

GO  TO  80 
C  CONVERT  VALUE  TO  AN  INTEGER 
60     K=K+1 

IF  (K.LE.6)  GO  TO  70 
C    MORE  THAN  6  CHAR  IN  FIELD 

IF  (MOD( IERR, 4) .LE.l)  IERR= 

GO  TO  90 
70     LIN2(K)=J 


ARACTERS  (FIRST  6  USED) 
CHARACTERS  (NO  CALC.  MADE) 
EN 


234567890 
3456789 


(36) 
(34) 


HI ,1HJ,1HK,1HL,1HM,1HN,1H0,1HP, 

HY,1HZ, 

H9,1H0/ 


HI,1HJ,1HK,1HL,1HM,1HN,1H0,1HP, 
HY,1HZ, 


10 


C 
20 

30 
40 

50 

c 


IGNORE  BLANKS  AND  MINUS 


IERR+2 


CDK 

1 

CDK 

2 

CDK 

2A 

CDK 

2B 

CDK 

3 

CDK 

4 

CDK 

5 

CDK 

6 

CDK 

7 

CDK 

8 

CDK 

9 

CDK 

10 

CDK 

11 

CDK 

12 

CDK 

13 

CDK 

14 

CDK 

15 

CDK 

16 

CDK 

17 

CDK 

18 

CDK 

19 

CDK 

20 

CDK 

21 

CDK 

22 

CDK 

23 

CDK 

24 

CDK 

25 

CDK 

26 

CDK 

27 

CDK 

28 

CDK 

29 

CDK 

30 

CDK 

31 

CDK 

32 

CDK 

33 

CDK 

34 

CDK 

35 

CDK 

36 

CDK 

37 

CDK 

38 

CDK 

39 

CDK 

40 

CDK 

41 

CDK 

42 

CDK 

43 

CDK 

44 

CDK 

45 

CDK 

46 

CDK 

47 

CDK 

48 

CDK 

49 

CDK 

50 

CDK 

51 

CDK 

52 

CDK 

53 

CDK 

54 

CDK 

55 

CDK 

56 

CDK 

57 

CDK 

58 

CDK 

59 

MINE(K)=LINE ( I ) 
80     CONTINUE 

IF  (K-5)  140,100,90 
90     M0DE=1 
100    MSUM=11*LIN2 ( 1) 

MSUM=MSUM+7*LIN2( 2  ) 

MSUM=MSUM+5*LIN2(3  ) 

MSUM=MSUM+3*LIN2(4) 

MSUM=MSUM+LIN2<5) 
C   COMPUTE   NUMERICAL  EQUIVALENT  OF  CHECK  CHARACTER 

ICK=MOD(MSUM-l»34)+l 

ICHECK=JODICT( ICK) 

IF  (MODE-1)  130»110*130 
C  COMPARE  WITH  EXISTING  CHECK  CHARACTER 


110 


C 

120 

130 

C 

140 

150 


JCK=LIN2 (6) 
JCHECK=INDICT( JCK) 
IF  ( ICHECK-JCHECK)  120,130*120 
INCONSISTENT  CHECK  CHARACTER 
IF  (MODI IERR.2) .LT.l)  IERR=IERR+1 
MINE(6)=ICHECK 
60  TO  150 
FEWER  THAN  5  CHARACTERS 

IF  (MOD( IERR,8) .LT.4)  IERR=IERR+4 

RETURN 

END 


CDK 

60 

CDK 

61 

CDK 

62 

CDK 

63 

CDK 

64 

CDK 

65 

CDK 

66 

CDK 

67 

CDK 

68 

CDK 

69 

CDK 

70 

CDK 

71 

CDK 

72 

CDK 

73 

CDK 

74 

CDK 

75 

CDK 

76 

CDK 

77 

CDK 

78 

CDK 

79 

CDK 

80 

CDK 

81 

CDK 

82 

CDK 

83 

CDK 

84 

10 

c 

20 


30 

40 

50 

C 

60 

70 


SAMPLE  MAIN  PROGRAM  FOR  CODEN  INPUT  FROM  CARDS 
DIMENSION   LINE(80)  .MINE180) 
CARD  READER  AND  PRINTER  UNITS 

KR  =  5 

KP  =  6 
ZONE  ON  THE  INPUT  WHERE  CODEN  IS  LOCATED 

11  =  1 

12=10 

PRINT  10 

FORMAT  (»1  OUTPUT        INPUT  FOR  CODEN  CHECK  PROGRAM') 
READ  THE  INPUT.   THIS  IS  UNIVAC  FORTRAN  V 

READ  <KR,40,END=70)  ( L I NE ( I ) , I = 1 , 80 ) 

CALL  CODC'HK  (  L  I  NE  ,  M  I  NE  ,  I  1  ♦  I  2  .  I  ERR  ) 

IF  ( IERR.EQ.O)  GO  TO  30 

WRITE  (KP',.50)  IERR 

WRITE  (KP.60)  (MINE(  I  )» 1  =  1.6)  .  (LINEt  I  ),  1  =  11,12) 

GO  TO  20 

FORMAT  (80A1) 

FORMAT  (•  *****   ERROR  TYPE" 13,  '  ON  NEXT  CARD') 
FORMAT  FOR  ARRAY  LINE  SHOULD  ALLOW  FOR  1 2-1 1+1  CHARACTERS 

FORMAT  (1H  6A1,6X,30A1) 

STOP 

END 


CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 
CDN 


1 
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6 

7 

8 

9 
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12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23- 


TEST   OF    SUBROUTINE    CODCHK    (TYPED    COPY    OF    OUTPUT) 


OUTPUT  INPUT   FOR   CODEN    CHECK    PROGRAM 

*****      ERROR    TYPE      4    ON   NEXT    CARD 
ABCD  ABCD 

*****      ERROR    TYPE      3    ON   NEXT   CARD 
ABCDEW  ABCDEFG 

ANYAA9  ANYAA 


APOPAI 

APOPA 

7v  «"  7v  77  7v 

ERROR   TYPE      1 

ON 

NEXT 

CARD 

CHREAY 

CHREAB 

***** 

ERROR   TYPE      8 

ON 

NEXT 

CARD 

GCAXZ6 

*GCAXZ 

«  w  Sv  w  « 

ERROR   TYPE      8 

ON 

NEXT 

CARD 

GCAXZ6 

GC*AXZ 

***** 

ERROR    TYPE    12 

ON 

NEXT 

CARD 

GCAZ 

GC*A*Z 

IJCKBO 

IJCKB 

JACSAT 

JACSA 

JACSAT 

JACS-A 

JACSAT 

J    A    C    SAT 

***** 

ERROR   TYPE      1 

ON 

NEXT 

CARD 

JASCAR 

JASCAT 

JATPA3 

JATPA 

JGREA2 

JGREA 

NBCIAG 

NBC  I A 

NBCIAG 

NBCIA-G 
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